One-step extrapolation of the prediction performance of a gene signature derived from a small study

نویسندگان

  • Ling-Yi Wang
  • Wen-Chung Lee
چکیده

OBJECTIVE Microarray-related studies often involve a very large number of genes and small sample size. Cross-validating or bootstrapping is therefore imperative to obtain a fair assessment of the prediction/classification performance of a gene signature. A deficiency of these methods is the reduced training sample size because of the partition process in cross-validation and sampling with replacement in bootstrapping. To address this problem, we aim to obtain a prediction performance estimate that strikes a good balance between bias and variance and has a small root mean squared error. METHODS We propose to make a one-step extrapolation from the fitted learning curve to estimate the prediction/classification performance of the model trained by all the samples. RESULTS Simulation studies show that the method strikes a good balance between bias and variance and has a small root mean squared error. Three microarray data sets are used for demonstration. CONCLUSIONS Our method is advocated to estimate the prediction performance of a gene signature derived from a small study.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Changes on Mean Particle Size in a Fluidized Bed using Vibration Signature

Vibration signals were measured in a lab-scale fluidized bed to investigate the changes in particle sizes. Experiments were carried out in the bed with a different mass fraction of coarser particles at different superficial gas velocities, and probe heights. The S-statistic test evaluates the dimensionless squared distance between two attractors reconstructed from time series of vibration signa...

متن کامل

A Study on the Financial Performance of Companies Using Data Envelopment Analysis Model and Zemijsky's Model and a Comparison of their Results

Recent bankruptcy of big companies all over the world and fluctuations in Iran's stock market require that some methods be developed for the evaluation of companies' financial potential. Different models are used for the prediction of bankruptcy and the evaluation of organizational financial situation. Environmental changes and increasing competition among agencies led to companies' and organiz...

متن کامل

Evaluation of Gene Mutations Involved in Drug Resistance in Mycobacterium Tuberculosis Strains Derived from Tuberculosis Patients in Mazandaran, Iran, 2013

Drug resistance (especially multiple drug resistance) in Mycobacterium tuberculosis makes global concerns in treatment and control of tuberculosis. Rapid diagnosis of drug resistant strains of the bacteria has vital importance in the prognosis of the disease. The aim of this study was to identify the mutations responsible for drug resistance in Mycobacterium tuberculosis strains derived from pa...

متن کامل

Comparison of Gene Expression Programming (GEP) and Parametric and Non-parametric Regression Methods in the Prediction of the Mean Daily Discharge of Karun River (A case Study: Mollasani Hydrometric Station)

Nowadays, the prediction of river discharge is one of the important issues in hydrology and water resources; the results of daily river discharge pattern could be used in the management of water resources and hydraulic structures and flood prediction. In this research, Gene Expression Programming (GEP), parametric Linear Regression (LR), parametric Nonlinear Regression (NLR) and non-parametric ...

متن کامل

Deflection Measurement of Masonry Arch Bridges with Tall Piers: Case Study of Shahbazan Bridge

A common practice for detailed assessment of masonry bridges is to use recorded deflection signature of mid-span of such structures due to predefined loading schemes. However, measuring the deflection of bridges with tall piers or those situated over deep valleys introduces certain difficulties, since common deflection-meters require a reference point relative to which the measureme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2015